Summarizing Spoken Documents: avoiding distracting content

نویسندگان

  • Ricardo Ribeiro
  • David Martins de Matos
چکیده

Driven by a cognitive perspective of the human summarization process, we address the problem of assessing the most relevant information of a single spoken language document, by minimizing the influence of distracting content, of which passages particularly affected by spoken language-related problems are major representatives. Two different approaches are considered. One, based only on the input source to be summarized, consists in a centrality-based relevance model for automatic summarization that uses support sets to better estimate the relevant content. Geometric proximity is used to compute semantic relatedness. Relevance is determined by considering the whole input source, and by assuming that information sources to be summarized comprehend different topics. A thorough evaluation shows statistically significant improvements over previous approaches. The other mimics the natural human behavior, in which information acquired from different sources is used to build a better understanding of a given topic. Information from different types of sources and of the same type is explored. A multi-document summarization framework provides the means to assess the relevant content. A perceptual evaluation shows that mixing information leads to considerably better results, both in terms of informativeness and readability. Concerning the use of information of the same type, results show that background information of the same topic clearly improves the detection of the most important content.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Summarizing multiple spoken documents: finding evidence from untranscribed audio

This paper presents a model for summarizing multiple untranscribed spoken documents. Without assuming the availability of transcripts, the model modifies a recently proposed unsupervised algorithm to detect re-occurring acoustic patterns in speech and uses them to estimate similarities between utterances, which are in turn used to identify salient utterances and remove redundancies. This model ...

متن کامل

Hierarchical topic organization and visual presentation of spoken documents using probabilistic latent semantic analysis (PLSA) for efficient retrieval/browsing applications

The most attractive form of future network content will be multi-media including speech information, and such speech information usually carries the core concepts for the content. As a result, the spoken documents associated with the multi-media content very possibly can serve as the key for retrieval and browsing. This paper presents a new approach of hierarchical topic organization and visual...

متن کامل

Summarizing Speech Without Text Using Hidden Markov Models

We present a method for summarizing speech documents without using any type of transcript/text in a Hidden Markov Model framework. The hidden variables or states in the model represent whether a sentence is to be included in a summary or not, and the acoustic/prosodic features are the observation vectors. The model predicts the optimal sequence of segments that best summarize the document. We e...

متن کامل

Semantic Similarity for Detecting Recognition Errors in Automatic Speech Transcripts

Browsing through large volumes of spoken audio is known to be a challenging task for end users. One way to alleviate this problem is to allow users to gist a spoken audio document by glancing over a transcript generated through Automatic Speech Recognition. Unfortunately, such transcripts typically contain many recognition errors which are highly distracting and make gisting more difficult. In ...

متن کامل

Multi-layered Summarization of Spo Information Extraction and S

The spoken documents are very difficult to be shown on the screen, and very difficult to retrieve and browse. It is therefore important to develop technologies to summarize the entire archives of the huge quantities of spoken documents in the network content to help the user in browsing and retrieval. In this paper we propose a complete set of multi-layered technologies to handle at least some ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012